22,491 results on '"Gao, Yuan"'
Search Results
2. The Compatibility, Convergence, and Stability of Difference Schemes
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
3. Numerical Effect
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
4. Coupled Simplified Lattice Boltzmann Method Study on Thermal Flows
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
5. Variable Coefficients and Nonlinear Problems
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
6. Difference Scheme
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
7. Common Difference Schemes for Several Model Equations
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
8. Assessment and Validation of No-slip Boundary Conditions for the Discrete Unified Gas Kinetic Scheme
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
9. A Simplified Lattice Boltzmann Flux Solver of Multiphase Flows
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
10. Studying Drag Reduction of Square Cylinder Based on the LBM
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
11. Difference Schemes for Multi-dimensional Problems
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
12. Initial Boundary Value Problems
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
13. Applications of Finite Difference Methods in Fluid Mechanics
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
14. Computational Fluid Dynamics
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
15. Fundamentals of Lattice Boltzmann Method
- Author
-
Hou, Guoxiang, primary, Chen, Caikan, additional, Qin, Shenglei, additional, Gao, Yuan, additional, and Wang, Kai, additional
- Published
- 2024
- Full Text
- View/download PDF
16. Core Investigation of the International Continental Scientific Drilling Program from the Cretaceous Songliao Basin (SK-1/SK-2/SK-3), NE China
- Author
-
Gao, Yuan, primary, Gao, Youfeng, additional, and Ibarra, Daniel E., additional
- Published
- 2024
- Full Text
- View/download PDF
17. Digital Copyright Transaction Scheme Based on Blockchain Technology
- Author
-
Gao, Yuan, primary, Wen, Jin, additional, Miao, Peidong, additional, and Wang, Zhiqiang, additional
- Published
- 2024
- Full Text
- View/download PDF
18. Fuzzy PID Control of Brushless DC Motor Based on the Improved Bat Algorithm
- Author
-
Li, Jie, primary, Zhao, Wentian, additional, Bai, Jiahang, additional, Yang, Liu, additional, Zhang, Tianxu, additional, Gao, Yuan, additional, Hu, Man, additional, Wang, Ziyuan, additional, Liu, Yuhao, additional, and Lv, Jingzhi, additional
- Published
- 2024
- Full Text
- View/download PDF
19. Research on Network Security Situation Assessment Method
- Author
-
Gao, Yuan, primary, Wen, Jin, additional, Chen, Pu, additional, and Wang, Zhiqiang, additional
- Published
- 2024
- Full Text
- View/download PDF
20. Spin Supersolid Phase and Double Magnon-Roton Excitations in a Cobalt-based Triangular Lattice
- Author
-
Gao, Yuan, Zhang, Chuandi, Xiang, Junsen, Yu, Dehong, Lu, Xingye, Sun, Peijie, Jin, Wentao, Su, Gang, and Li, Wei
- Subjects
Condensed Matter - Strongly Correlated Electrons - Abstract
Supersolid is an exotic quantum state of matter that hosts spontaneously the features of both solid and superfluidity, which breaks the lattice translational symmetry and U(1) gauge symmetry. Here we conduct inelastic neutron scattering (INS) measurements and tensor-network calculations on the triangular-lattice cobaltate Na$_2$BaCo(PO$_4$)$_2$, which is proposed in [Xiang ${\it et al.}$, Nature 625, 270-275 (2024)] as a quantum magnetic analog of supersolid. We uncover characteristic dynamical signatures, which include distinct magnetic Bragg peaks indicating out-of-plane spin solidity and gapless Goldstone modes corresponding to the in-plane spin superfluidity, offering comprehensive spectroscopic evidence for spin supersolid in Na$_2$BaCo(PO$_4$)$_2$. We also compute spin dynamics of the easy-axis triangular-lattice model, and reveal magnon-roton excitations containing U(1) Goldstone and roton modes associated with the in-plane spin superfluidity, as well as pseudo-Goldstone and roton modes related to the out-of-plane spin solidity, rendering double magnon-roton dispersions in the spin supersolid. Akin to the role of phonon-roton dispersion in shaping the helium thermodynamics, the intriguing magnetic excitations also strongly influence the low-temperature thermodynamics of spin supersolid down to sub-Kelvin regime, explaining the recently observed giant magnetocaloric effect in Na$_2$BaCo(PO$_4$)$_2$.
- Published
- 2024
21. Carleman estimates for higher order partial differential operators and its applications
- Author
-
Fu, Xiaoyu and Gao, Yuan
- Subjects
Mathematics - Analysis of PDEs ,Mathematics - Optimization and Control - Abstract
In this paper, we obtain a Carleman estimate for the higher order partial differential operator. In the process of establishing this estimate, we developed a new method, which is called the back-propagation method (the BPM, for short). This method can also be used to build up Carleman estimates for some other partial differential operators, and might provide assistance with corresponding numerical analyses. As an application of the above-mentioned Carleman estimate, we proved the conditional stability of a Cauchy problem for a time fractional diffusion equation.
- Published
- 2024
22. A Latent Factor Model for High-Dimensional Binary Data
- Author
-
Shi, Jiaxin, Gao, Yuan, Pan, Rui, and Wang, Hansheng
- Subjects
Statistics - Methodology - Abstract
In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent variables. Our method assumes that the dependency structure of the observed binary data can be fully captured by the continuous latent factors. To estimate the model, a moment-based estimation method is developed. The proposed method is able to deal with both discontinuity and high dimensionality. Most importantly, the asymptotic properties of the resulting estimators are rigorously established. Extensive simulation studies are presented to demonstrate the proposed methodology. A real dataset about product descriptions is analysed for illustration.
- Published
- 2024
23. Vision-Language Model-based Physical Reasoning for Robot Liquid Perception
- Author
-
Lai, Wenqiang, Gao, Yuan, and Lam, Tin Lun
- Subjects
Computer Science - Robotics - Abstract
There is a growing interest in applying large language models (LLMs) in robotic tasks, due to their remarkable reasoning ability and extensive knowledge learned from vast training corpora. Grounding LLMs in the physical world remains an open challenge as they can only process textual input. Recent advancements in large vision-language models (LVLMs) have enabled a more comprehensive understanding of the physical world by incorporating visual input, which provides richer contextual information than language alone. In this work, we proposed a novel paradigm that leveraged GPT-4V(ision), the state-of-the-art LVLM by OpenAI, to enable embodied agents to perceive liquid objects via image-based environmental feedback. Specifically, we exploited the physical understanding of GPT-4V to interpret the visual representation (e.g., time-series plot) of non-visual feedback (e.g., F/T sensor data), indirectly enabling multimodal perception beyond vision and language using images as proxies. We evaluated our method using 10 common household liquids with containers of various geometry and material. Without any training or fine-tuning, we demonstrated that our method can enable the robot to indirectly perceive the physical response of liquids and estimate their viscosity. We also showed that by jointly reasoning over the visual and physical attributes learned through interactions, our method could recognize liquid objects in the absence of strong visual cues (e.g., container labels with legible text or symbols), increasing the accuracy from 69.0% -- achieved by the best-performing vision-only variant -- to 86.0%., Comment: 8 pages, 6 figures, submitted to IROS 2024
- Published
- 2024
24. Anchor-based Robust Finetuning of Vision-Language Models
- Author
-
Han, Jinwei, Lin, Zhiwen, Sun, Zhongyisun, Gao, Yingguo, Yan, Ke, Ding, Shouhong, Gao, Yuan, and Xia, Gui-Song
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We aim at finetuning a vision-language model without hurting its out-of-distribution (OOD) generalization. We address two types of OOD generalization, i.e., i) domain shift such as natural to sketch images, and ii) zero-shot capability to recognize the category that was not contained in the finetune data. Arguably, the diminished OOD generalization after finetuning stems from the excessively simplified finetuning target, which only provides the class information, such as ``a photo of a [CLASS]''. This is distinct from the process in that CLIP was pretrained, where there is abundant text supervision with rich semantic information. Therefore, we propose to compensate for the finetune process using auxiliary supervision with rich semantic information, which acts as anchors to preserve the OOD generalization. Specifically, two types of anchors are elaborated in our method, including i) text-compensated anchor which uses the images from the finetune set but enriches the text supervision from a pretrained captioner, ii) image-text-pair anchor which is retrieved from the dataset similar to pretraining data of CLIP according to the downstream task, associating with the original CLIP text with rich semantics. Those anchors are utilized as auxiliary semantic information to maintain the original feature space of CLIP, thereby preserving the OOD generalization capabilities. Comprehensive experiments demonstrate that our method achieves in-distribution performance akin to conventional finetuning while attaining new state-of-the-art results on domain shift and zero-shot learning benchmarks., Comment: CVPR2024
- Published
- 2024
25. A Monte-Carlo Simulation on Resonant Scattering of X-ray Line Emission in Supernova Remnants
- Author
-
Li, Yiping, Zhang, Gao-Yuan, Chen, Yang, Sun, Lei, and Zhang, Shuinai
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Resonant scattering (RS) of X-ray line emission in supernova remnants (SNRs) may modify the observed line profiles and fluxes and has potential impact on estimating the physical properties of the hot gas and hence on understanding the SNR physics, but has not been theoretically modeled ever. Here we present our Monte-Carlo simulation of RS effect on X-ray resonant-line emission, typified by O VII He$\alpha$ r line, from SNRs. We employ the physical conditions characterized by the Sedov-Taylor solution and some basic parameters similar to those in Cygnus Loop. We show that the impact of RS effect is most significant near the edge of the remnant. The line profiles are predicted to be asymmetric because of different temperatures and photon production efficiencies of the expanding gas at different radii. We also predict the surface brightness of the line emission would decrease in the outer projected region but is slightly enhanced in the inner. The G-ratio of the OVII He$\alpha$ triplet can be effectively elevated by RS in the outer region. We show that RS effect of the O VII He$\alpha$ r line in the southwestern boundary region of Cygnus Loop is non-negligible. The observed OVII G-ratio $\sim$1.8 of the region could be achieved with RS taken into account for properly elevated O abundance from the previous estimates. Additional simulation performed for the SNRs in ejecta-dominated phase like Cas A shows that RS in the shocked ejecta may have some apparently effects on the observational properties of oxygen resonant lines., Comment: 19 pages, 17 figures, accepted by ApJ
- Published
- 2024
26. On a result by Baillon, Bruck, and Reich
- Author
-
Bauschke, Heinz H. and Gao, Yuan
- Subjects
Mathematics - Functional Analysis ,Mathematics - Optimization and Control ,47H05, 47H09 (Primary) 47N10, 65K05, 90C25 (Secondary) - Abstract
It is well known that the iterates of an averaged nonexpansive mapping may only converge weakly to fixed point. A celebrated result by Baillon, Bruck, and Reich from 1978 yields strong convergence in the presence of linearity. In this paper, we extend this result to allow for flexible relaxation parameters. Examples are also provided to illustrate the results.
- Published
- 2024
27. Convergence of Continuous Normalizing Flows for Learning Probability Distributions
- Author
-
Gao, Yuan, Huang, Jian, Jiao, Yuling, and Zheng, Shurong
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,62G05, 68T07 - Abstract
Continuous normalizing flows (CNFs) are a generative method for learning probability distributions, which is based on ordinary differential equations. This method has shown remarkable empirical success across various applications, including large-scale image synthesis, protein structure prediction, and molecule generation. In this work, we study the theoretical properties of CNFs with linear interpolation in learning probability distributions from a finite random sample, using a flow matching objective function. We establish non-asymptotic error bounds for the distribution estimator based on CNFs, in terms of the Wasserstein-2 distance. The key assumption in our analysis is that the target distribution satisfies one of the following three conditions: it either has a bounded support, is strongly log-concave, or is a finite or infinite mixture of Gaussian distributions. We present a convergence analysis framework that encompasses the error due to velocity estimation, the discretization error, and the early stopping error. A key step in our analysis involves establishing the regularity properties of the velocity field and its estimator for CNFs constructed with linear interpolation. This necessitates the development of uniform error bounds with Lipschitz regularity control of deep ReLU networks that approximate the Lipschitz function class, which could be of independent interest. Our nonparametric convergence analysis offers theoretical guarantees for using CNFs to learn probability distributions from a finite random sample., Comment: 60 pages, 3 tables, and 3 figures
- Published
- 2024
28. Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation
- Author
-
Lin, Xiao, Yang, Wenfei, Gao, Yuan, and Zhang, Tianzhu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Category-level 6D object pose estimation aims to estimate the rotation, translation and size of unseen instances within specific categories. In this area, dense correspondence-based methods have achieved leading performance. However, they do not explicitly consider the local and global geometric information of different instances, resulting in poor generalization ability to unseen instances with significant shape variations. To deal with this problem, we propose a novel Instance-Adaptive and Geometric-Aware Keypoint Learning method for category-level 6D object pose estimation (AG-Pose), which includes two key designs: (1) The first design is an Instance-Adaptive Keypoint Detection module, which can adaptively detect a set of sparse keypoints for various instances to represent their geometric structures. (2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features. These two modules can work together to establish robust keypoint-level correspondences for unseen instances, thus enhancing the generalization ability of the model.Experimental results on CAMERA25 and REAL275 datasets show that the proposed AG-Pose outperforms state-of-the-art methods by a large margin without category-specific shape priors., Comment: Accepted to CVPR2024
- Published
- 2024
29. Linear dynamics and classical tests of the gravitational quantum field theory
- Author
-
Gao, Yuan-Kun, Huang, Da, Ma, Yong-Liang, Tang, Yong, Wu, Yue-Liang, and Zhou, Yu-Feng
- Subjects
General Relativity and Quantum Cosmology ,High Energy Physics - Phenomenology ,High Energy Physics - Theory - Abstract
We explore the new physics phenomena of gravidynamics governed by the inhomogeneous spin gauge symmetry based on the gravitational quantum field theory. Such a gravidynamics enables us to derive the generalized Einstein equation and an equation beyond it. To simplify the analyses, we linearize the dynamic equations of gravitational interaction by keeping terms up to the leading order in the dual gravigauge field. We then apply the linearized dynamic equations into two particular gravitational phenomena. First, we consider the linearized equations in the absence of source fields, which is shown to have five physical propagating polarizations as gravitational waves, i.e., two tensor modes, two vector modes, and one scalar, instead of two tensor polarizations in the general relativity. Second, we examine the Newtonian limit in which the gravitational fields and the matter source distribution are weak and static. By deriving the associated Poisson equation, we obtain the exact relation of the fundamental interaction coupling in the gravidynamics with the experimentally measured Newtonian constant. We also make use of nonrelativistic objects and relativistic photons to probe the Newtonian field configurations. In particular, the experiments from the gravitational deflection of light rays and the Shapiro time delay can place stringent constraints on the linearized gravidynamics in the gravitational quantum field theory., Comment: 7 pages, 1 figure
- Published
- 2024
- Full Text
- View/download PDF
30. MEDBind: Unifying Language and Multimodal Medical Data Embeddings
- Author
-
Gao, Yuan, Kim, Sangwook, Austin, David E, and McIntosh, Chris
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Medical vision-language pretraining models (VLPM) have achieved remarkable progress in fusing chest X-rays (CXR) with clinical texts, introducing image-text data binding approaches that enable zero-shot learning and downstream clinical tasks. However, the current landscape lacks the holistic integration of additional medical modalities, such as electrocardiograms (ECG). We present MEDBind (Medical Electronic patient recorD), which learns joint embeddings across CXR, ECG, and medical text. Using text data as the central anchor, MEDBind features tri-modality binding, delivering competitive performance in top-K retrieval, zero-shot, and few-shot benchmarks against established VLPM, and the ability for CXR-to-ECG zero-shot classification and retrieval. This seamless integration is achieved through combination of contrastive loss on modality-text pairs with our proposed contrastive loss function, Edge-Modality Contrastive Loss, fostering a cohesive embedding space for CXR, ECG, and text. Finally, we demonstrate that MEDBind can improve downstream tasks by directly integrating CXR and ECG embeddings into a large-language model for multimodal prompt tuning.
- Published
- 2024
31. Thermal Tensor Network Approach for Spin-Lattice Relaxation in Quantum Magnets
- Author
-
Xi, Ning, Gao, Yuan, Li, Chengchen, Liang, Shuang, Yu, Rong, Wang, Xiaoqun, and Li, Wei
- Subjects
Condensed Matter - Strongly Correlated Electrons - Abstract
Low-dimensional quantum magnets, particularly those with strong spin frustration, are characterized by their notable spin fluctuations. Nuclear magnetic resonance (NMR) serves as a sensitive probe of low-energy fluctuations that offers valuable insight into rich magnetic phases and emergent phenomena in quantum magnets. Although experimentally accessible, the numerical simulation of NMR relaxation rates, specifically the spin-lattice relaxation rate $1/T_1$, remains a significant challenge. Analytical continuation based on Monte Carlo calculations are hampered by the notorious negative sign for frustrated systems, and the real-time simulations incur significant costs to capture low-energy fluctuations. Here we propose computing the relaxation rate using thermal tensor networks (TTNs), which provides a streamlined approach by calculating its imaginary-time proxy. We showcase the accuracy and versatility of our methodology by applying it to one-dimensional spin chains and two-dimensional lattices, where we find that the critical exponents $\eta$ and $z\nu$ can be extracted from the low-temperature scalings of the simulated $1/T_1$ near quantum critical points. Our results also provide insights into the low-dimensional and frustrated magnetic materials, elucidating universal scaling behaviors in the Ising chain compound CoNb$_2$O$_6$ and revealing the renormalized classical behaviors in the triangular-lattice antiferromagnet Ba$_8$CoNb$_6$O$_{24}$. We apply the approach to effective model of the family of frustrated magnets AYbCh$_2$ (A = Na, K, Cs, and Ch = O, S, Se), and find dramatic changes from spin ordered to the proposed quantum spin liquid phase. Overall, with high reliability and accuracy, the TTN methodology offers a systematic strategy for studying the intricate dynamics observed across a broad spectrum of quantum magnets and related fields., Comment: 15 pages, 12 figures
- Published
- 2024
32. Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering
- Author
-
Gao, Yuan, Zhu, Yiheng, Cao, Yuanbin, Zhou, Yinzhi, Wu, Zhen, Chen, Yujie, Wu, Shenglan, Hu, Haoyuan, and Dai, Xinyu
- Subjects
Computer Science - Computation and Language - Abstract
Open Domain Multi-Hop Question Answering (ODMHQA) plays a crucial role in Natural Language Processing (NLP) by aiming to answer complex questions through multi-step reasoning over retrieved information from external knowledge sources. Recently, Large Language Models (LLMs) have demonstrated remarkable performance in solving ODMHQA owing to their capabilities including planning, reasoning, and utilizing tools. However, LLMs may generate off-topic answers when attempting to solve ODMHQA, namely the generated answers are irrelevant to the original questions. This issue of off-topic answers accounts for approximately one-third of incorrect answers, yet remains underexplored despite its significance. To alleviate this issue, we propose the Discriminate->Re-Compose->Re- Solve->Re-Decompose (Dr3) mechanism. Specifically, the Discriminator leverages the intrinsic capabilities of LLMs to judge whether the generated answers are off-topic. In cases where an off-topic answer is detected, the Corrector performs step-wise revisions along the reversed reasoning chain (Re-Compose->Re-Solve->Re-Decompose) until the final answer becomes on-topic. Experimental results on the HotpotQA and 2WikiMultiHopQA datasets demonstrate that our Dr3 mechanism considerably reduces the occurrence of off-topic answers in ODMHQA by nearly 13%, improving the performance in Exact Match (EM) by nearly 3% compared to the baseline method without the Dr3 mechanism., Comment: LREC-COLING 2024, Long Paper
- Published
- 2024
33. A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
- Author
-
Li, Xuetong, Gao, Yuan, Chang, Hong, Huang, Danyang, Ma, Yingying, Pan, Rui, Qi, Haobo, Wang, Feifei, Wu, Shuyuan, Xu, Ke, Zhou, Jing, Zhu, Xuening, Zhu, Yingqiu, and Wang, Hansheng
- Subjects
Statistics - Methodology ,Computer Science - Machine Learning ,Mathematics - Statistics Theory ,Statistics - Computation - Abstract
This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first class of literature is about distributed computing and focuses on the situation, where the dataset size is too huge to be comfortably handled by one single computer. In this case, a distributed computation system with multiple computers has to be utilized. The second class of literature is about subsampling methods and concerns about the situation, where the sample size of dataset is small enough to be placed on one single computer but too large to be easily processed by its memory as a whole. The last class of literature studies those minibatch gradient related optimization techniques, which have been extensively used for optimizing various deep learning models.
- Published
- 2024
- Full Text
- View/download PDF
34. Enhancing Vision-Language Pre-training with Rich Supervisions
- Author
-
Gao, Yuan, Shi, Kunyu, Zhu, Pengkai, Belval, Edouard, Nuriel, Oren, Appalaraju, Srikar, Ghadar, Shabnam, Mahadevan, Vijay, Tu, Zhuowen, and Soatto, Stefano
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for Vision-Language Models using data from large-scale web screenshot rendering. Using web screenshots unlocks a treasure trove of visual and textual cues that are not present in using image-text pairs. In S4, we leverage the inherent tree-structured hierarchy of HTML elements and the spatial localization to carefully design 10 pre-training tasks with large scale annotated data. These tasks resemble downstream tasks across different domains and the annotations are cheap to obtain. We demonstrate that, compared to current screenshot pre-training objectives, our innovative pre-training method significantly enhances performance of image-to-text model in nine varied and popular downstream tasks - up to 76.1% improvements on Table Detection, and at least 1% on Widget Captioning., Comment: Accepted to CVPR 2024
- Published
- 2024
35. Non-Convex Stochastic Composite Optimization with Polyak Momentum
- Author
-
Gao, Yuan, Rodomanov, Anton, and Stich, Sebastian U.
- Subjects
Mathematics - Optimization and Control ,Computer Science - Machine Learning - Abstract
The stochastic proximal gradient method is a powerful generalization of the widely used stochastic gradient descent (SGD) method and has found numerous applications in Machine Learning. However, it is notoriously known that this method fails to converge in non-convex settings where the stochastic noise is significant (i.e. when only small or bounded batch sizes are used). In this paper, we focus on the stochastic proximal gradient method with Polyak momentum. We prove this method attains an optimal convergence rate for non-convex composite optimization problems, regardless of batch size. Additionally, we rigorously analyze the variance reduction effect of the Polyak momentum in the composite optimization setting and we show the method also converges when the proximal step can only be solved inexactly. Finally, we provide numerical experiments to validate our theoretical results.
- Published
- 2024
36. Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding
- Author
-
Liu, Guangyi, Wang, Yu, Feng, Zeyu, Wu, Qiyu, Tang, Liping, Gao, Yuan, Li, Zhen, Cui, Shuguang, McAuley, Julian, Xing, Eric P., Yang, Zichao, and Hu, Zhiting
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
The vast applications of deep generative models are anchored in three core capabilities -- generating new instances, reconstructing inputs, and learning compact representations -- across various data types, such as discrete text/protein sequences and continuous images. Existing model families, like Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), autoregressive models, and diffusion models, generally excel in specific capabilities and data types but fall short in others. We introduce generalized diffusion with learnable encoder-decoder (DiLED), that seamlessly integrates the core capabilities for broad applicability and enhanced performance. DiLED generalizes the Gaussian noising-denoising in standard diffusion by introducing parameterized encoding-decoding. Crucially, DiLED is compatible with the well-established diffusion model objective and training recipes, allowing effective learning of the encoder-decoder parameters jointly with diffusion. By choosing appropriate encoder/decoder (e.g., large language models), DiLED naturally applies to different data types. Extensive experiments on text, proteins, and images demonstrate DiLED's flexibility to handle diverse data and tasks and its strong improvement over various existing models.
- Published
- 2024
37. Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation
- Author
-
Ming, Yu, Wu, Zihao, Yang, Jie, Li, Danyi, Gao, Yuan, Gao, Changxin, Xia, Gui-Song, Li, Yuanqing, Liang, Li, and Yu, Jin-Gang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Nucleus instance segmentation from histopathology images suffers from the extremely laborious and expert-dependent annotation of nucleus instances. As a promising solution to this task, annotation-efficient deep learning paradigms have recently attracted much research interest, such as weakly-/semi-supervised learning, generative adversarial learning, etc. In this paper, we propose to formulate annotation-efficient nucleus instance segmentation from the perspective of few-shot learning (FSL). Our work was motivated by that, with the prosperity of computational pathology, an increasing number of fully-annotated datasets are publicly accessible, and we hope to leverage these external datasets to assist nucleus instance segmentation on the target dataset which only has very limited annotation. To achieve this goal, we adopt the meta-learning based FSL paradigm, which however has to be tailored in two substantial aspects before adapting to our task. First, since the novel classes may be inconsistent with those of the external dataset, we extend the basic definition of few-shot instance segmentation (FSIS) to generalized few-shot instance segmentation (GFSIS). Second, to cope with the intrinsic challenges of nucleus segmentation, including touching between adjacent cells, cellular heterogeneity, etc., we further introduce a structural guidance mechanism into the GFSIS network, finally leading to a unified Structurally-Guided Generalized Few-Shot Instance Segmentation (SGFSIS) framework. Extensive experiments on a couple of publicly accessible datasets demonstrate that, SGFSIS can outperform other annotation-efficient learning baselines, including semi-supervised learning, simple transfer learning, etc., with comparable performance to fully supervised learning with less than 5% annotations.
- Published
- 2024
38. AoSRNet: All-in-One Scene Recovery Networks via Multi-knowledge Integration
- Author
-
Lu, Yuxu, Yang, Dong, Gao, Yuan, Liu, Ryan Wen, Liu, Jun, and Guo, Yu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Scattering and attenuation of light in no-homogeneous imaging media or inconsistent light intensity will cause insufficient contrast and color distortion in the collected images, which limits the developments such as vision-driven smart urban, autonomous vehicles, and intelligent robots. In this paper, we propose an all-in-one scene recovery network via multi-knowledge integration (termed AoSRNet) to improve the visibility of imaging devices in typical low-visibility imaging scenes (e.g., haze, sand dust, and low light). It combines gamma correction (GC) and optimized linear stretching (OLS) to create the detail enhancement module (DEM) and color restoration module (CRM). Additionally, we suggest a multi-receptive field extraction module (MEM) to attenuate the loss of image texture details caused by GC nonlinear and OLS linear transformations. Finally, we refine the coarse features generated by DEM, CRM, and MEM through Encoder-Decoder to generate the final restored image. Comprehensive experimental results demonstrate the effectiveness and stability of AoSRNet compared to other state-of-the-art methods. The source code is available at \url{https://github.com/LouisYuxuLu/AoSRNet}.
- Published
- 2024
39. EXGC: Bridging Efficiency and Explainability in Graph Condensation
- Author
-
Fang, Junfeng, Li, Xinglin, Sui, Yongduo, Gao, Yuan, Zhang, Guibin, Wang, Kun, Wang, Xiang, and He, Xiangnan
- Subjects
Computer Science - Machine Learning - Abstract
Graph representation learning on vast datasets, like web data, has made significant strides. However, the associated computational and storage overheads raise concerns. In sight of this, Graph condensation (GCond) has been introduced to distill these large real datasets into a more concise yet information-rich synthetic graph. Despite acceleration efforts, existing GCond methods mainly grapple with efficiency, especially on expansive web data graphs. Hence, in this work, we pinpoint two major inefficiencies of current paradigms: (1) the concurrent updating of a vast parameter set, and (2) pronounced parameter redundancy. To counteract these two limitations correspondingly, we first (1) employ the Mean-Field variational approximation for convergence acceleration, and then (2) propose the objective of Gradient Information Bottleneck (GDIB) to prune redundancy. By incorporating the leading explanation techniques (e.g., GNNExplainer and GSAT) to instantiate the GDIB, our EXGC, the Efficient and eXplainable Graph Condensation method is proposed, which can markedly boost efficiency and inject explainability. Our extensive evaluations across eight datasets underscore EXGC's superiority and relevance. Code is available at https://github.com/MangoKiller/EXGC.
- Published
- 2024
40. DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation
- Author
-
Gao, Yuan, Chen, Haokun, Wang, Xiang, Wang, Zhicai, Wang, Xue, Gao, Jinyang, and Ding, Bolin
- Subjects
Quantitative Finance - Statistical Finance ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Machine learning models have demonstrated remarkable efficacy and efficiency in a wide range of stock forecasting tasks. However, the inherent challenges of data scarcity, including low signal-to-noise ratio (SNR) and data homogeneity, pose significant obstacles to accurate forecasting. To address this issue, we propose a novel approach that utilizes artificial intelligence-generated samples (AIGS) to enhance the training procedures. In our work, we introduce the Diffusion Model to generate stock factors with Transformer architecture (DiffsFormer). DiffsFormer is initially trained on a large-scale source domain, incorporating conditional guidance so as to capture global joint distribution. When presented with a specific downstream task, we employ DiffsFormer to augment the training procedure by editing existing samples. This editing step allows us to control the strength of the editing process, determining the extent to which the generated data deviates from the target domain. To evaluate the effectiveness of DiffsFormer augmented training, we conduct experiments on the CSI300 and CSI800 datasets, employing eight commonly used machine learning models. The proposed method achieves relative improvements of 7.2% and 27.8% in annualized return ratio for the respective datasets. Furthermore, we perform extensive experiments to gain insights into the functionality of DiffsFormer and its constituent components, elucidating how they address the challenges of data scarcity and enhance the overall model performance. Our research demonstrates the efficacy of leveraging AIGS and the DiffsFormer architecture to mitigate data scarcity in stock forecasting tasks.
- Published
- 2024
41. Weaver: Foundation Models for Creative Writing
- Author
-
Wang, Tiannan, Chen, Jiamin, Jia, Qingrui, Wang, Shuai, Fang, Ruoyu, Wang, Huilin, Gao, Zhaowei, Xie, Chunzhao, Xu, Chuou, Dai, Jihong, Liu, Yibin, Wu, Jialong, Ding, Shengwei, Li, Long, Huang, Zhiwei, Deng, Xinle, Yu, Teng, Ma, Gangan, Xiao, Han, Chen, Zixin, Xiang, Danjun, Wang, Yunxia, Zhu, Yuanyuan, Xiao, Yi, Wang, Jing, Wang, Yiru, Ding, Siran, Huang, Jiayang, Xu, Jiayi, Tayier, Yilihamu, Hu, Zhenyu, Gao, Yuan, Zheng, Chengfeng, Ye, Yueshu, Li, Yihang, Wan, Lei, Jiang, Xinyue, Wang, Yujie, Cheng, Siyu, Song, Zhule, Tang, Xiangru, Xu, Xiaohua, Zhang, Ningyu, Chen, Huajun, Jiang, Yuchen Eleanor, and Zhou, Wangchunshu
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for instruction data synthesis and LLM alignment, making it able to produce more human-like texts and follow more diverse instructions for content creation. The Weaver family consists of models of Weaver Mini (1.8B), Weaver Base (6B), Weaver Pro (14B), and Weaver Ultra (34B) sizes, suitable for different applications and can be dynamically dispatched by a routing agent according to query complexity to balance response quality and computation cost. Evaluation on a carefully curated benchmark for assessing the writing capabilities of LLMs shows Weaver models of all sizes outperform generalist LLMs several times larger than them. Notably, our most-capable Weaver Ultra model surpasses GPT-4, a state-of-the-art generalist LLM, on various writing scenarios, demonstrating the advantage of training specialized LLMs for writing purposes. Moreover, Weaver natively supports retrieval-augmented generation (RAG) and function calling (tool usage). We present various use cases of these abilities for improving AI-assisted writing systems, including integration of external knowledge bases, tools, or APIs, and providing personalized writing assistance. Furthermore, we discuss and summarize a guideline and best practices for pre-training and fine-tuning domain-specific LLMs.
- Published
- 2024
42. Alleviating Structural Distribution Shift in Graph Anomaly Detection
- Author
-
Gao, Yuan, Wang, Xiang, He, Xiangnan, Liu, Zhenguang, Feng, Huamin, and Zhang, Yongdong
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Graph anomaly detection (GAD) is a challenging binary classification problem due to its different structural distribution between anomalies and normal nodes -- abnormal nodes are a minority, therefore holding high heterophily and low homophily compared to normal nodes. Furthermore, due to various time factors and the annotation preferences of human experts, the heterophily and homophily can change across training and testing data, which is called structural distribution shift (SDS) in this paper. The mainstream methods are built on graph neural networks (GNNs), benefiting the classification of normals from aggregating homophilous neighbors, yet ignoring the SDS issue for anomalies and suffering from poor generalization. This work solves the problem from a feature view. We observe that the degree of SDS varies between anomalies and normal nodes. Hence to address the issue, the key lies in resisting high heterophily for anomalies meanwhile benefiting the learning of normals from homophily. We tease out the anomaly features on which we constrain to mitigate the effect of heterophilous neighbors and make them invariant. We term our proposed framework as Graph Decomposition Network (GDN). Extensive experiments are conducted on two benchmark datasets, and the proposed framework achieves a remarkable performance boost in GAD, especially in an SDS environment where anomalies have largely different structural distribution across training and testing environments. Codes are open-sourced in https://github.com/blacksingular/wsdm_GDN., Comment: Accepted to WSDM 2023
- Published
- 2024
- Full Text
- View/download PDF
43. New type of solutions for Schr\'odinger equations with critical growth
- Author
-
Gao, Yuan and Guo, Yuxia
- Subjects
Mathematics - Analysis of PDEs ,Mathematics - Functional Analysis ,35A01, 35B33, 35B38 - Abstract
We consider the following nonlinear Schr\"odinger equations with critical growth: \begin{equation} - \Delta u + V(|y|)u=u^{\frac{N+2}{N-2}},\quad u>0 \ \ \mbox{in} \ \mathbb {R}^N, \end{equation} where $V(|y|)$ is a bounded positive radial function in $C^1$, $N\ge 5$. By using a finite reduction argument, we show that if $r^2V(r)$ has either an isolated local maximum or an isolated minimum at $r_0>0$ with $V(r_0)>0$, there exists infinitely many non-radial large energy solutions which are invariant under some sub-groups of $O(3)$., Comment: 38 pages, 0 figures
- Published
- 2024
44. Mean Field Games for Controlling Coherent Structures in Nonlinear Fluid Systems
- Author
-
Gao, Yuan and Qi, Di
- Subjects
Mathematics - Optimization and Control ,Mathematics - Numerical Analysis - Abstract
This paper discusses the control of coherent structures in turbulent flows, which has broad applications among complex systems in science and technology. Mean field games have been proved a powerful tool and are proposed here to control the stochastic Lagrangian tracers as players tracking the flow field. We derive optimal control solutions for general nonlinear fluid systems using mean field game models, and develop computational algorithms to efficiently solve the resulting coupled forward and backward mean field system. A precise link is established for the control of Lagrangian tracers and the scalar vorticity field based on the functional Hamilton-Jacobi equations derived from the mean field models. New iterative numerical strategy is then constructed to compute the optimal solution with fast convergence. We verify the skill of the mean field control models and illustrate their practical efficiency on a prototype model modified from the viscous Burger's equation under various cost functions in both deterministic and stochastic formulations. The good model performance implies potential effectiveness of the strategy for more general high-dimensional turbulent systems., Comment: 26 pages, 8 figures
- Published
- 2024
45. MvKSR: Multi-view Knowledge-guided Scene Recovery for Hazy and Rainy Degradation
- Author
-
Yang, Dong, Xu, Wenyu, Gao, Yuan, Lu, Yuxu, Zhang, Jingming, and Guo, Yu
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
High-quality imaging is crucial for ensuring safety supervision and intelligent deployment in fields like transportation and industry. It enables precise and detailed monitoring of operations, facilitating timely detection of potential hazards and efficient management. However, adverse weather conditions, such as atmospheric haziness and precipitation, can have a significant impact on image quality. When the atmosphere contains dense haze or water droplets, the incident light scatters, leading to degraded captured images. This degradation is evident in the form of image blur and reduced contrast, increasing the likelihood of incorrect assessments and interpretations by intelligent imaging systems (IIS). To address the challenge of restoring degraded images in hazy and rainy conditions, this paper proposes a novel multi-view knowledge-guided scene recovery network (termed MvKSR). Specifically, guided filtering is performed on the degraded image to separate high/low-frequency components. Subsequently, an en-decoder-based multi-view feature coarse extraction module (MCE) is used to coarsely extract features from different views of the degraded image. The multi-view feature fine fusion module (MFF) will learn and infer the restoration of degraded images through mixed supervision under different views. Additionally, we suggest an atrous residual block to handle global restoration and local repair in hazy/rainy/mixed scenes. Extensive experimental results demonstrate that MvKSR outperforms other state-of-the-art methods in terms of efficiency and stability for restoring degraded scenarios in IIS.
- Published
- 2024
46. Magnon Damping Minimum and Logarithmic Scaling in a Kondo-Heisenberg Model
- Author
-
Gao, Yuan, Wang, Junsen, Li, Qiaoyi, Yan, Qing-Bo, Shi, Tao, and Li, Wei
- Subjects
Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Statistical Mechanics - Abstract
Recently, an anomalous temperature evolution of spin wave excitations has been observed in a van der Waals metallic ferromagnet Fe$_3$GeTe$_2$ (FGT) [S. Bao, et al., Phys. Rev. X 12, 011022 (2022)], whose theoretical understanding yet remains elusive. Here we study the spin dynamics of a ferromagnetic Kondo-Heisenberg lattice model at finite temperature, and propose a mechanism of magnon damping that explains the intriguing experimental results. In particular, we find the magnon damping rate $\gamma(T)$ firstly decreases as temperature lowers, due to the reduced magnon-magnon scatterings. It then reaches a minimum at $T_{\rm d}^*$, and rises up again following a logarithmic scaling $\gamma(T) \sim \ln{(T_0/T)}$ (with $T_0$ a constant) for $T < T_{\rm d}^*$, which can be attributed to electron-magnon scatterings of spin-flip type. Moreover, we obtain the phase diagram containing the ferromagnetic and Kondo insulator phases by varying the Kondo coupling, which may be relevant for experiments on pressured FGT. The presence of a magnon damping minimum and logarithmic scaling at low temperature indicates the emergence of the Kondo effect reflected in the collective excitations of local moments in a Kondo lattice system.
- Published
- 2024
47. I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
- Author
-
Guo, Xun, Zheng, Mingwu, Hou, Liang, Gao, Yuan, Deng, Yufan, Wan, Pengfei, Zhang, Di, Liu, Yufan, Hu, Weiming, Zha, Zhengjun, Huang, Haibin, and Ma, Chongyang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Text-guided image-to-video (I2V) generation aims to generate a coherent video that preserves the identity of the input image and semantically aligns with the input prompt. Existing methods typically augment pretrained text-to-video (T2V) models by either concatenating the image with noised video frames channel-wise before being fed into the model or injecting the image embedding produced by pretrained image encoders in cross-attention modules. However, the former approach often necessitates altering the fundamental weights of pretrained T2V models, thus restricting the model's compatibility within the open-source communities and disrupting the model's prior knowledge. Meanwhile, the latter typically fails to preserve the identity of the input image. We present I2V-Adapter to overcome such limitations. I2V-Adapter adeptly propagates the unnoised input image to subsequent noised frames through a cross-frame attention mechanism, maintaining the identity of the input image without any changes to the pretrained T2V model. Notably, I2V-Adapter only introduces a few trainable parameters, significantly alleviating the training cost and also ensures compatibility with existing community-driven personalized models and control tools. Moreover, we propose a novel Frame Similarity Prior to balance the motion amplitude and the stability of generated videos through two adjustable control coefficients. Our experimental results demonstrate that I2V-Adapter is capable of producing high-quality videos. This performance, coupled with its agility and adaptability, represents a substantial advancement in the field of I2V, particularly for personalized and controllable applications.
- Published
- 2023
48. Observation of Magnon Damping Minimum Induced by Kondo Coupling in a van der Waals Ferromagnet Fe$_{3-x}$GeTe$_{2}$
- Author
-
Bao, Song, Wang, Junsen, Yano, Shin-ichiro, Shangguan, Yanyan, Huang, Zhentao, Liao, Junbo, Wang, Wei, Gao, Yuan, Zhang, Bo, Cheng, Shufan, Xu, Hao, Dong, Zhao-Yang, Yu, Shun-Li, Li, Wei, Li, Jian-Xin, and Wen, Jinsheng
- Subjects
Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Superconductivity - Abstract
In heavy-fermion systems with $f$ electrons, there is an intricate interplay between Kondo screening and magnetic correlations, which can give rise to various exotic phases. Recently, similar interplay appears to also occur in $d$-electron systems, but the underlying mechanism remains elusive. Here, using inelastic neutron scattering, we investigate the temperature evolution of the low-energy spin waves in a metallic van der Waals ferromagnet Fe$_{3-x}$GeTe$_{2}$ (Curie temperature $T_{\rm C}\sim160$ K), where the Kondo-lattice behavior emerges in the ferromagnetic phase below a characteristic temperature $T^*\sim90$ K. We observe that the magnon damping constant diverges at both low and high temperatures, exhibiting a minimum coincidentally around $T^*$. Such an observation is analogous to the resistivity minimum as due to the single-impurity Kondo effect. This unusual behavior is described by a formula that combines logarithmic and power terms, representing the dominant contributions from Kondo screening and thermal fluctuations, respectively. Furthermore, we find that the magnon damping increases with momentum below $T_{\rm C}$. These findings can be explained by considering spin-flip electron-magnon scattering, which serves as a magnonic analog of the Kondo-impurity scattering, and thus provides a measure of the Kondo coupling through magnons. Our results provide critical insights into how Kondo coupling manifests itself in a system with magnetic ordering and shed light on the coexistence of and interplay between magnetic order and Kondo effect in itinerant 3$d$-electron systems., Comment: 8 pages, 4 figures
- Published
- 2023
49. Inferring Hybrid Neural Fluid Fields from Videos
- Author
-
Yu, Hong-Xing, Zheng, Yang, Gao, Yuan, Deng, Yitong, Zhu, Bo, and Wu, Jiajun
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics - Abstract
We study recovering fluid density and velocity from sparse multiview videos. Existing neural dynamic reconstruction methods predominantly rely on optical flows; therefore, they cannot accurately estimate the density and uncover the underlying velocity due to the inherent visual ambiguities of fluid velocity, as fluids are often shapeless and lack stable visual features. The challenge is further pronounced by the turbulent nature of fluid flows, which calls for properly designed fluid velocity representations. To address these challenges, we propose hybrid neural fluid fields (HyFluid), a neural approach to jointly infer fluid density and velocity fields. Specifically, to deal with visual ambiguities of fluid velocity, we introduce a set of physics-based losses that enforce inferring a physically plausible velocity field, which is divergence-free and drives the transport of density. To deal with the turbulent nature of fluid velocity, we design a hybrid neural velocity representation that includes a base neural velocity field that captures most irrotational energy and a vortex particle-based velocity that models residual turbulent velocity. We show that our method enables recovering vortical flow details. Our approach opens up possibilities for various learning and reconstruction applications centered around 3D incompressible flow, including fluid re-simulation and editing, future prediction, and neural dynamic scene composition. Project website: https://kovenyu.com/HyFluid/, Comment: NeurIPS 2023. Project website: https://kovenyu.com/HyFluid/ The first two authors contribute equally
- Published
- 2023
50. Bayesian Optimization Algorithms for Accelerator Physics
- Author
-
Roussel, Ryan, Edelen, Auralee L., Boltz, Tobias, Kennedy, Dylan, Zhang, Zhe, Ji, Fuhao, Huang, Xiaobiao, Ratner, Daniel, Garcia, Andrea Santamaria, Xu, Chenran, Kaiser, Jan, Pousa, Angel Ferran, Eichler, Annika, Lubsen, Jannis O., Isenberg, Natalie M., Gao, Yuan, Kuklev, Nikita, Martinez, Jose, Mustapha, Brahim, Kain, Verena, Lin, Weijian, Liuzzo, Simone Maria, John, Jason St., Streeter, Matthew J. V., Lehe, Remi, and Neiswanger, Willie
- Subjects
Physics - Accelerator Physics - Abstract
Accelerator physics relies on numerical algorithms to solve optimization problems in online accelerator control and tasks such as experimental design and model calibration in simulations. The effectiveness of optimization algorithms in discovering ideal solutions for complex challenges with limited resources often determines the problem complexity these methods can address. The accelerator physics community has recognized the advantages of Bayesian optimization algorithms, which leverage statistical surrogate models of objective functions to effectively address complex optimization challenges, especially in the presence of noise during accelerator operation and in resource-intensive physics simulations. In this review article, we offer a conceptual overview of applying Bayesian optimization techniques towards solving optimization problems in accelerator physics. We begin by providing a straightforward explanation of the essential components that make up Bayesian optimization techniques. We then give an overview of current and previous work applying and modifying these techniques to solve accelerator physics challenges. Finally, we explore practical implementation strategies for Bayesian optimization algorithms to maximize their performance, enabling users to effectively address complex optimization challenges in real-time beam control and accelerator design.
- Published
- 2023
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.